[fbgemm_gpu] Remove `sm_100` and `sm_120` #4024

q10 · 2025-04-25T19:11:02Z

Remove sm_100 and sm_120 from architectures list and keep just sm_100a and sm_120a instead, to enable compilation for FP4 CUTLASS quantization kernels (Enable FP4 CUTLASS GEMM and CUDA quantization kernels #4004), since we are running into the following error:

Instruction 'cvt with .e2m1x2' not supported on .target 'sm_100'

netlify · 2025-04-25T19:11:30Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`35984fd`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/6811c7a746ef870008bb4e50
😎 Deploy Preview	https://deploy-preview-4024--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot · 2025-04-30T04:25:13Z

@q10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2025-04-30T05:09:11Z

@q10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: X-link: facebookresearch/FBGEMM#1133 - Remove sm_100 and sm_120 from architectures list and keep just sm_100a and sm_120a instead, to enable compilation for FP4 CUTLASS quantization kernels (pytorch#4004), since we are running into the following error: ``` Instruction 'cvt with .e2m1x2' not supported on .target 'sm_100' ``` Reviewed By: spcyppt Differential Revision: D73901832 Pulled By: q10

facebook-github-bot · 2025-04-30T06:45:51Z

This pull request was exported from Phabricator. Differential Revision: D73901832

Summary: X-link: facebookresearch/FBGEMM#1133 - Remove sm_100 and sm_120 from architectures list and keep just sm_100a and sm_120a instead, to enable compilation for FP4 CUTLASS quantization kernels (pytorch#4004), since we are running into the following error: ``` Instruction 'cvt with .e2m1x2' not supported on .target 'sm_100' ``` Reviewed By: spcyppt Differential Revision: D73901832 Pulled By: q10

facebook-github-bot · 2025-04-30T06:48:12Z

This pull request was exported from Phabricator. Differential Revision: D73901832

facebook-github-bot · 2025-04-30T09:33:44Z

@q10 merged this pull request in c7ded05.

Summary: X-link: https://github.com/facebookresearch/FBGEMM/pull/1133 - Remove sm_100 and sm_120 from architectures list and keep just sm_100a and sm_120a instead, to enable compilation for FP4 CUTLASS quantization kernels (pytorch#4004), since we are running into the following error: ``` Instruction 'cvt with .e2m1x2' not supported on .target 'sm_100' ``` Pull Request resolved: pytorch#4024 Reviewed By: spcyppt Differential Revision: D73901832 Pulled By: q10 fbshipit-source-id: 690c58b214aee80374e43a93bf39fe70e430da9a

facebook-github-bot added the cla signed label Apr 25, 2025

q10 force-pushed the bm/test-remove-sm100 branch from f7a44b8 to fdfbe68 Compare April 30, 2025 04:23

q10 changed the title ~~[fbgemm_gpu] Remove sm_100 and sm_120~~ [fbgemm_gpu] Remove sm_100 and sm_120 Apr 30, 2025

q10 force-pushed the bm/test-remove-sm100 branch from fdfbe68 to 12eea6a Compare April 30, 2025 06:45

facebook-github-bot added the fb-exported label Apr 30, 2025

q10 force-pushed the bm/test-remove-sm100 branch from 12eea6a to 35984fd Compare April 30, 2025 06:48

facebook-github-bot closed this in c7ded05 Apr 30, 2025

facebook-github-bot added the Merged label Apr 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fbgemm_gpu] Remove `sm_100` and `sm_120` #4024

[fbgemm_gpu] Remove `sm_100` and `sm_120` #4024

q10 commented Apr 25, 2025 •

edited

Loading

netlify bot commented Apr 25, 2025 •

edited

Loading

facebook-github-bot commented Apr 30, 2025

facebook-github-bot commented Apr 30, 2025

facebook-github-bot commented Apr 30, 2025

facebook-github-bot commented Apr 30, 2025

facebook-github-bot commented Apr 30, 2025

[fbgemm_gpu] Remove sm_100 and sm_120 #4024

[fbgemm_gpu] Remove sm_100 and sm_120 #4024

Conversation

q10 commented Apr 25, 2025 • edited Loading

netlify bot commented Apr 25, 2025 • edited Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

facebook-github-bot commented Apr 30, 2025

facebook-github-bot commented Apr 30, 2025

facebook-github-bot commented Apr 30, 2025

facebook-github-bot commented Apr 30, 2025

facebook-github-bot commented Apr 30, 2025

[fbgemm_gpu] Remove `sm_100` and `sm_120` #4024

[fbgemm_gpu] Remove `sm_100` and `sm_120` #4024

q10 commented Apr 25, 2025 •

edited

Loading

netlify bot commented Apr 25, 2025 •

edited

Loading